Execution validation using header containing validation data

ABSTRACT

The present invention adds a procedure to the operating system file subsystem of a processing system that significantly reduces the amount of time necessary to verify the validity of executable files. Each executable is extended with a file signature containing a header containing validation data. This header may be added to an existing ELF header, added as a new section, or placed in a file&#39;s extended attribute store. The header contains results of all previous validation checks that have been performed. The file signature is inserted, with a date stamp, into the file attributes. On execution, the system checks the previously-created file signature against a current file signature, instead of creating the file signature for every file during the execution process. Checks to ensure that the file signature is secure, and is valid and up to date, are also implemented. Only if the file signature is not valid and up-to-date does the execution program create a new file signature at the time of execution.

CROSS-REFERENCE TO RELATED APPLICATIONS

This application is a continuation-in-part (CIP) of U.S. application Ser. No. 11/024,914, filed Dec. 28, 2004, the contents of which are fully incorporated herein by reference.

BACKGROUND OF THE INVENTION

1. Field of the Invention

This invention relates to validation of executable files.

2. Description of the Related Art

Among all computing and networking security issues, the most important cause of concern does not come from intrusions, but from the widespread proliferation of viruses. Viral infections represent the great majority of all security incidents and consume massive amounts of time and resources in their detection and in correcting problems associated with the execution of undetected viruses on a system.

In their most basic form, viruses and other unwanted programming instructions manifest themselves in unwanted code and/or programs inserted into files that executed on a computer system. The various manners in which such unwanted instructions are copied into computer systems are well known and are thus not discussed further herein.

To combat the problem of viruses and other unwanted code finding their way onto and executing on computer systems, virus scan programs have been developed to identify the existence of viruses on a system. In addition, executable programs have been developed which are digitally signed, whereby a kernel checks the digital signature each time the executable file is to be run, refusing to allow it to run if the signature is not valid.

Virus scan programs have been developed to identify the existence of viruses on a system. Each file on a computer system is scanned and a check sum or hash value (or simply “hash”) is created for each file. A hash, also called a message digest, is a number generated from a string of text. The hash is substantially smaller than the text itself, and is generated by a formula in such a way that is extremely unlikely that some other text will produce the same hash value. Hashes play a role in security systems where they are used to ensure that transmitted messages have not been tampered with. The sender generates a hash of the message, encrypts it, and sends it with the message itself. The recipient then decrypts both the message and the hash, produces another hash from the received message, and compares the two hashes. If they are the same, there is a high probability that the message was transmitted intact. The concept of hashing is well known and is not discussed further herein.

The file name and the hash value are compared to a virus signature file which contains information regarding all known viruses as of the date the virus scan program was last updated. If a match is found (i.e., if the file name, or elements of the file itself (it's hash matches a known virus hash), correspond to the name or elements of a known virus), the file containing the match is quarantined and rendered inoperable, repaired, or deleted. The virus signature files are updated periodically, e.g., weekly or more frequently if needed as new virus files are discovered. This requires users to run a complete virus scan on all files each time the virus signature files are updated.

Virus scan programs take a long time to perform their scanning and checking process. For each 10 Gigabytes of memory, it can take approximately 30 minutes to complete a scan and check operation. As the size of hard drives increase, and with the increase in size of software images due to multimedia content, such as MP3 and digital pictures, the problem of increased scan time is only getting worse. The scanning operation itself uses significant system resources and thus delays other operations that a user is attempting to perform. Further, with the proliferation of mobile laptop devices, it is often impossible to schedule virus scans during off hours, as can be done with desktop systems that are never turned off, since laptop systems are typically turned off when not in operation.

The digital signatures on the executables can be of two types: symmetric or asymmetric. A symmetric signature uses a secret key to key a message authentication code (MAC), taken across the entire content of the executable file. Symmetric signatures can be verified with relatively little overhead, but the key must remain secret, or the attacker can forge valid signatures. This makes symmetric signatures useful only in the local case. In addition, the key must be kept secret on the local machine, and this is very difficult to do.

An asymmetric signature uses a public key signature pair, such as with the RSA signature scheme. In this case the private key is used to sign, and the public key is used to verify the signature. The private key needs to remain secret, but need exist only on the signing system. All other systems can verify the signature knowing only the public key, which need not be secret. Thus executables signed with asymmetric signatures are muc more flexible, as the signed executable can be widely distributed, while the signing remains centralized. Unfortunately, public key signature verification have much higher overhead than symmetric ones.

Accordingly, it would be desirable to have a system and method for decreasing the time and overhead associated with verifying executable files.

SUMMARY OF THE INVENTION

The present invention adds a procedure to the operating system file subsystem of a processing system that significantly reduces the amount of time necessary to verify the validity of executable files. Each executable is extended with a file signature containing a header containing validation data. This header may be added to an existing ELF header, added as a new section, or placed in a file's extended attribute store. The header contains results of all previous verification checks that have been performed. The file signature is inserted, with a date stamp, into the file attributes. On execution, the system checks the previously-created file signature against a current file signature, instead of creating the file signature for every file during the execution process. Checks to ensure that the file signature is secure, and is valid and up to date, are also implemented. Only if the file signature is not valid and up-to-date does the execution program create a new file signature at the time of execution.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 is a flowchart illustrating an example of the basic steps utilized to create a file signature correlated to the creation of a hash value when a file is being written to, in accordance with the present invention; and

FIG. 2 illustrates an example of steps performed, in accordance with the present invention, when executing a file containing the novel file signature.

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS

FIG. 1 is a flowchart illustrating an example of the basic steps utilized to create a file signature correlated to the creation of a hash value when a file is being executed, in accordance with the present invention. At step 100, the process begins for executing an executable file. All previous validation methods are performed at this time, and the results of this check are inserted into a validation header, along with a hash of the file, and the MAC of the header. At step 102, a “validation timestamp” associated with the time and date at which the validation process is performed, is created.

At step 104, a hash value is created for the file being validated at essentially the same time as the validation process has been completed. To avoid problems with slight delays between the actual time that the validation process completes and the actual time that the creation of the hash value is completed, a predetermined +/− factor can be applied when making comparisons described below.

At step 106, a hash value timestamp is created associated with the creation time and date of the hash value. At step 108, the hash value itself, along with the hash value timestamp are inserted into the attribute (header) of the file that was just written to. The process is then completed.

FIG. 2 illustrates an example of steps performed, in accordance with the present invention, when executing a file containing the novel signature of the present invention.

At step 200, the latest validation signature file(s) are downloaded for use by the execution software. As is well known, this assures that the validation process is performed using the most currently available validation definitions and other related files.

At step 202, the validation process commences. At step 204, for the next file on the hard drive to be read, the hash and timestamp fields, are read from the file. At step 206, the hash timestamp is compared with the write timestamp to make sure that neither the hash value nor the file being scanned have changed since the last valid write of the file. This protects against a “smart virus” that can insert itself into a file after the valid write to the drive has been completed. Essentially, this assures that the file has not been changed since the hash value and hash value timestamp were inserted into the header field.

At step 208, if it is determined that the hash timestamp and the write timestamp are different (or are not within the +/− factor indicated above), then a new hash value is created on the fly, prior to proceeding to step 212. If, however, at step 208 it is determined that the hash timestamp and the write timestamp are the same (or essentially the same), the process proceeds directly to step 212.

At step 212, the hash value is compared with the validation signature file in a well known manner to determine if there are any “matches” indicating the existence of validation-related elements in the hash value. If, at step 214, there are no matches found, the file is determined to be valid at step 216, and the process proceeds directly to step 220.

However, if, at step 214, it is determined that there is a match between the hash value and the validation signature file, this indicates the presence of a contamination or invalid file of some kind, and the process proceeds to step 218 where the invalidity is acknowledged and corrective measures are taken in a well known manner. The process then proceeds to step 220, where it is determined if there are more files to be validated. If, at step 220, it is determined that there are more files to be validated, the process proceeds back to step 204 where the above steps are again performed. If, at step 220, it is determined that there are no more files to be validated, the process ends.

As noted above, using the present invention, the operating system file of a computer system is modified to create the hash during each file write. This hash is timestamped and may be encoded with a security chip such as the Trusted Platform Module or any other well known industry crypto processor. The security chip contains a monotonic counter (can only count forward, never can be reset or pulled backwards) which is used within the well known algorithm to create the timestamp. This prevents an intelligent virus from creating an incorrect timestamp.

Using the present invention, the validation process, which is extremely time-consuming, is performed on a file-by-file basis, whenever the file is executed. This spreads out the creation of hash values over the course of the use of the computer, rather than at a specific point in time when a file execution is being performed. A digital signature (hash) is created and stored within each file attribute, which are stored in the header field in front of the actual data. The hash represents the file signature, the TPM or security chip then encrypts the hash with the monotonic counter to timestamp it. The write is also timestamped and encrypted. This way, a smart virus cannot fake any of this data.

When the verification process is performed, in most cases the majority of the files will have a write timestamp and hash timestamp that match, indicating that the stored hash table can be used for the verification process. Accordingly, at the time of running the verification process, most of the time will be consumed comparing the hash table to the verification definitions rather than creating the hash tables.

The above-described steps can be implemented using standard well-known programming techniques. The novelty of the above-described embodiment lies not in the specific programming techniques but in the use of the steps described to achieve the described results. Software programming code which embodies the present invention is typically stored in permanent storage of some type, such as permanent storage of a workstation located on a hard drive, flash memory, etc. In a client/server environment, such software programming code may be stored with storage associated with a server. The software programming code may be embodied on any of a variety of known media for use with a data processing system, such as a diskette, or hard drive, or CD-ROM. The code may be distributed on such media, or may be distributed to users from the memory or storage of one computer system over a network of some type to other computer systems for use by users of such other systems. The techniques and methods for embodying software program code on physical media and/or distributing software code via networks are well known and will not be further discussed herein.

It will be understood that each element of the illustrations, and combinations of elements in the illustrations, can be implemented by general and/or special purpose hardware-based systems that perform the specified functions or steps, or by combinations of general and/or special-purpose hardware and computer instructions.

These program instructions may be provided to a processor to produce a machine, such that the instructions that execute on the processor create means for implementing the functions specified in the illustrations. The computer program instructions may be executed by a processor to cause a series of operational steps to be performed by the processor to produce a computer-implemented process such that the instructions that execute on the processor provide steps for implementing the functions specified in the illustrations. Accordingly, FIGS. 1-2 support combinations of means for performing the specified functions, combinations of steps for performing the specified functions, and program instruction means for performing the specified functions.

Although the present invention has been described with respect to a specific preferred embodiment thereof, various changes and modifications may be suggested to one skilled in the art and it is intended that the present invention encompass such changes and modifications as fall within the scope of the appended claims. 

1. A method for increasing the operational efficiency of an executable-file validation program configured to perform a validation operation during execution of one or more executable computer files, comprising: creating a file signature for each execution process associated with each of said computer files; comparing each file signature with a validation signature during said validation operation; and based on said comparison, determining if any of said executable computer files are invalid or valid.
 2. The method of claim 1, wherein said each file signature created during the execution process includes a hash value corresponding to its associated computer file.
 3. The method of claim 2, wherein each file signature created during the execution process further includes a hash value time stamp corresponding to the time that the hash value corresponding to its associated computer file was created.
 4. The method of claim 3, wherein each file signature created during the execution process is inserted into a file header of its associated computer file.
 5. The method of claim 4, wherein said comparing step includes: comparing the hash value of each computer file to be verified with the validation signature file, whereby: if the hash value of one of said computer files matches one or more elements of said validation signature file, a determination is made that that computer file is invalid; and if the hash value of one of said computer files does not match one or more elements of said validation signature file, a determination is made that that computer file does not contain a virus.
 6. The method of claim 5, wherein said comparing step further includes: comparing a write time stamp of each computer file to be validated with the hash value time stamp of its associated file signature, whereby: if said write time stamp and said hash value time stamp indicate the essentially simultaneous occurrence of the validation process associated with said write time stamp and the creation of the hash value in its associated file signature, then the hash value associated with the file being validated is used for the validation process; and if said write time stamp and said hash value time stamp indicate the validation process associated with said write time stamp and the creation of the hash value in its associated file signature did not occur essentially simultaneously, then the hash value associated with the file being validated is not used for the validation process, and a new hash value is created, on the fly, for use in conducting the validation process of that computer file.
 7. A system for increasing the operational efficiency of a validation program configured to perform a validation process operation on one or more executable computer files, comprising: means for creating a file signature for each execution process associated with each of said computer files; means for comparing each file signature with a validation signature during said validation process; and means for determining if an invalid execution process exists in any of said computer files, based on said comparison.
 8. The system of claim 7, wherein said each file signature created during the validation process includes a hash value corresponding to its associated computer file.
 9. The system of claim 8, wherein each file signature created during the validation process further includes a hash value time stamp corresponding to the time that the hash value corresponding to its associated computer file was created.
 10. The system of claim 9, wherein each file signature created during the validation process is inserted into a file header of its associated computer file.
 11. The system of claim 10, wherein said means for comparing includes: means for comparing the hash value of each computer file to be validated with the validation signature file, whereby: if the hash value of one of said computer files matches one or more elements of said validation signature file, a determination is made that that computer file contains an invalid executable file; and if the hash value of one of said computer files does not match one or more elements of said validation signature file, a determination is made that that computer file does not contain an invalid executable file.
 12. The system of claim 11, wherein said means for comparing further includes: means for comparing a write time stamp of each computer file to be validated with the hash value time stamp of its associated file signature, whereby: if said write time stamp and said hash value time stamp indicate the essentially simultaneous occurrence of the validation process associated with said write time stamp and the creation of the hash value in its associated file signature, then the hash value associated with the file being scanned is used for the validation process; and if said write time stamp and said hash value time stamp indicate the validation process associated with said write time stamp and the creation of the hash value in its associated file signature did not occur essentially simultaneously, then the hash value associated with the file being scanned is not used for the validation process, and a new hash value is created, on the fly, for use in conducting the validation process scan of that computer file.
 13. A computer program product for increasing the operational efficiency of a validation program configured to perform a validation process on one or more executable computer files, the computer program product comprising a computer-readable storage medium having computer-readable program code embodied in the medium, the computer-readable program code comprising: computer-readable program code that creates a file signature for each execution process associated with each of said computer files; computer-readable program code that compares each file signature with a validation signature during said validation process; and computer-readable program code that determines if an invalid executable file exists in any of said computer files, based on said comparison.
 14. The computer program product of claim 13, wherein said each file signature created during the validation process includes a hash value corresponding to its associated computer file.
 15. The computer program product of claim 14, wherein each file signature created during the validation process further includes a hash value time stamp corresponding to the time that the hash value corresponding to its associated computer file was created.
 16. The computer program product of claim 15, wherein each file signature created during the validation process is inserted into a file header of its associated computer file.
 17. The computer program product of claim 16, wherein said computer-readable program code that compares each file signature with a validation signature during said validation process includes: computer-readable program code that compares the hash value of each computer file to be validated with the validation signature file, whereby: if the hash value of one of said computer files matches one or more elements of said validation signature file, a determination is made that that computer file is invalid; and if the hash value of one of said computer files does not match one or more elements of said validation signature file, a determination is made that that computer file is not invalid.
 18. The computer program product of claim 17, wherein said computer-readable program code that compares each file signature with a validation signature during said validation process further includes: computer-readable program code that compares a write time stamp of each computer file to be validated with the hash value time stamp of its associated file signature, whereby: if said write time stamp and said hash value time stamp indicate the essentially simultaneous occurrence of the validation process associated with said write time stamp and the creation of the hash value in its associated file signature, then the hash value associated with the file being validated is used for the validation process; and if said write time stamp and said hash value time stamp indicate the validation process associated with said write time stamp and the creation of the hash value in its associated file signature did not occur essentially simultaneously, then the hash value associated with the file being validated is not used for the validation process, and a new hash value is created, on the fly, for use in conducting the validation process of that computer file. 